944 research outputs found

    An End-to-End License Plate Localization and Recognition System

    Get PDF
    An end-to-end license plate recognition (LPR) system is proposed. It is composed of pre-processing, detection, segmentation and character recognition to find and recognize plates from camera based still images. The system utilizes connected component (CC) properties to quickly extract the license plate region. A novel two-stage CC filtering is utilized to address both shape and spatial relationship information to produce high precision and recall values for detection. Floating peak and valleys (FPV) of projection profiles are used to cut the license plates into individual characters. A turning function based method is proposed to recognize each character quickly and accurately. It is further accelerated using curvature histogram based support vector machine (SVM). The INFTY dataset is used to train the recognition system. And MediaLab license plate dataset is used for testing. The proposed system achieved 89.45% F-measure for detection and 87.33% accuracy for overall recognition rate which is comparable to current state-of-the-art systems

    Text Detection in Natural Scenes and Technical Diagrams with Convolutional Feature Learning and Cascaded Classification

    Get PDF
    An enormous amount of digital images are being generated and stored every day. Understanding text in these images is an important challenge with large impacts for academic, industrial and domestic applications. Recent studies address the difficulty of separating text targets from noise and background, all of which vary greatly in natural scenes. To tackle this problem, we develop a text detection system to analyze and utilize visual information in a data driven, automatic and intelligent way. The proposed method incorporates features learned from data, including patch-based coarse-to-fine detection (Text-Conv), connected component extraction using region growing, and graph-based word segmentation (Word-Graph). Text-Conv is a sliding window-based detector, with convolution masks learned using the Convolutional k-means algorithm (Coates et. al, 2011). Unlike convolutional neural networks (CNNs), a single vector/layer of convolution mask responses are used to classify patches. An initial coarse detection considers both local and neighboring patch responses, followed by refinement using varying aspect ratios and rotations for a smaller local detection window. Different levels of visual detail from ground truth are utilized in each step, first using constraints on bounding box intersections, and then a combination of bounding box and pixel intersections. Combining masks from different Convolutional k-means initializations, e.g., seeded using random vectors and then support vectors improves performance. The Word-Graph algorithm uses contextual information to improve word segmentation and prune false character detections based on visual features and spatial context. Our system obtains pixel, character, and word detection f-measures of 93.14%, 90.26%, and 86.77% respectively for the ICDAR 2015 Robust Reading Focused Scene Text dataset, out-performing state-of-the-art systems, and producing highly accurate text detection masks at the pixel level. To investigate the utility of our feature learning approach for other image types, we perform tests on 8- bit greyscale USPTO patent drawing diagram images. An ensemble of Ada-Boost classifiers with different convolutional features (MetaBoost) is used to classify patches as text or background. The Tesseract OCR system is used to recognize characters in detected labels and enhance performance. With appropriate pre-processing and post-processing, f-measures of 82% for part label location, and 73% for valid part label locations and strings are obtained, which are the best obtained to-date for the USPTO patent diagram data set used in our experiments. To sum up, an intelligent refinement of convolutional k-means-based feature learning and novel automatic classification methods are proposed for text detection, which obtain state-of-the-art results without the need for strong prior knowledge. Different ground truth representations along with features including edges, color, shape and spatial relationships are used coherently to improve accuracy. Different variations of feature learning are explored, e.g. support vector-seeded clustering and MetaBoost, with results suggesting that increased diversity in learned features benefit convolution-based text detectors

    Supersolid and pair correlations of the extended Jaynes-Cummings-Hubbard model on triangular lattices

    Full text link
    We study the extended Jaynes-Cummings-Hubbard model on triangular cavity lattices and zigzag ladders. By using density-matrix renormalization group methods, we observe various types of solids with different density patterns and find evidence for light supersolids, which exist in extended regions of the phase diagram of the zigzag ladder. Furthermore, we observe strong pair correlations in the supersolid phase due to the interplay between the atoms in the cavities and atom-photon interaction. By means of cluster mean-field simulations and a scaling of the cluster size extending our analysis to two-dimensional triangular lattices, we present evidence for the emergence of a light supersolid in this case also.Comment: 11 pages, 16 figure

    BCS-like disorder-driven instabilities and ultraviolet effects in nodal-line semimetals

    Full text link
    We study the effects of quenched disorder on electrons in a 3D nodal-line semimetal. Disorder leads to significant renormalisations of the quasiparticle properties due to ultraviolet processes, i.e. processes of scattering in a large band of momenta, of the width exceeding the inverse mean free path. As a result, observables such as the density of states and conductivity exhibit singular behaviour in a broad range of disorder strengths, excluding a small vicinity of the singular point. We find that, for example, the density of quasiparticle states diverges as a function of the disorder strength gg as ρ(g,E)gc(E)g2E\rho(g,E)\propto |g_c(E)-g|^{-2}|E| for gg smaller than the critical value gc(E)g_c(E) and crosses over to a constant for gg very close to gc(E)g_c(E), where EE is the quasiparticle energy. For certain disorder symmetries, a 3D disordered nodal-line semimetal can be mapped to a 2D metal with attractive interactions. The described disorder-driven instabilities in such a nodal-line semimetal are mapped to Cooper and exciton-condensation instabilities in a 2D metal. For other disorder symmetries, the respective instabilities are similar but not exactly dual. We discuss experimental conditions favourable for the observation of the described effects.Comment: 21 pages, 7 figure

    Knowledge-Driven Distractor Generation for Cloze-style Multiple Choice Questions

    Full text link
    In this paper, we propose a novel configurable framework to automatically generate distractive choices for open-domain cloze-style multiple-choice questions, which incorporates a general-purpose knowledge base to effectively create a small distractor candidate set, and a feature-rich learning-to-rank model to select distractors that are both plausible and reliable. Experimental results on datasets across four domains show that our framework yields distractors that are more plausible and reliable than previous methods. This dataset can also be used as a benchmark for distractor generation in the future.Comment: To appear at AAAI 202
    corecore